AN EQUIVALENCE BETWEEN INVERSE SUMSET THEOREMS 
AND INVERSE CONJECTURES FOR THE U 3 NORM 



BEN GREEN AND TERENCE TAO 

Abstract. We establish a correspondence between inverse sumset theorems (which 
can be viewed as classifications of approximate (abelian) groups) and inverse theorems 
ON . for the Gowers norms (which can be viewed as classifications of approximate poly- 

nomials). In particular, we show that the inverse sumset theorems of Freiman type 
04 | are equivalent to the known inverse results for the Gowers U 3 norms, and moreover 

that the conjectured polynomial strengthening of the former is also equivalent to the 
polynomial strengthening of the latter. We establish this equivalence in two model 
■ settings, namely that of the finite field vector spaces W%, and of the cyclic groups 

Z/NZ. 

In both cases the argument involves clarifying the structure of certain types of 
[■ J , approximate homomorphism. 

1. INTRODUCTION 

Approximate GROUPS. The notion of an approximate group has come to be seen 

as a central one in additive combinatorics. Let K ^ 1 be a parameter (the "roughness" 

CO ' parameter), and suppose that A is a finite subset of some ambient abelian group G = 

(G, +) (such as the integers Z). We say that A is a K -approximate group if A is 

Q\ ' symmetric (that is to say — x £ A whenever x £ A) and if the sumset A + A : = 

O ' 



{a + a' : a, a' £ A} is covered by K translates of A. Thus, for instance, the arithmetic 
progression {— N, . . . ,N} in the integers Z for any ^ 1 is a 3-approximate group, 
while the 1-approximate group are nothing more than the finite subgroups of G. 

The basic theory of approximate abelian groups was developed by Ruzsa in several 
papers [20], [2TJ [22] ; see also [21] for some extensions to non-abelian groups. 

Perhaps the most basic question to ask about an approximate group is that of the 
extent to which it resembles an actual group. A language for formalising this was 
introduced by the second author in [2S], and in the abelian case it reads as follows. 
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Definition 1.1 (Control). Let A and B be two sets in some ambient abelian group, and 
K ^ 1. We say that B i^-controls A if \B\ ^ K\A\ and if there is some set X in the 
ambient group with \X\ ^ K and such that A <0 B + X . 

Two of the landmark results of additive combinatorics may be stated in this language. 
The first of these may be found in [22] and the second in [1] , a paper which builds upon 
[6] and [2D]. 

Theorem 1.2 (Inverse sumset theorem for F£°). Suppose that A C is a K -approxi- 
mate group for some K ^ 2. Then A is e K ° -controlled by a {genuine) finite subgroup 
of¥f. 

Theorem 1.3 (Inverse sumset theorem for Z). Suppose that A C Z is a K -approximate 
group for some K ^ 2. Then A is e -controlled by a symmetric generalized arithmetic 
progression P = {l±xi + • ■ ■ + l^Xd '■ h £ Z, ^ Lj for all 1 ^ i ^ d} with dimension 
d <: K c . 

Remark 1.4. In this paper the letter C will always denote a constant, but different 
instances of the notation may indicate different constants. The restriction K ^ 2 is 
purely a notational convenience, so that we may write K c instead of CK . 

These theorems, the background to them and their proofs are now discussed in many 
places. See, for example, the book Neither result is usually formulated in precisely 
this fashion, but simple arguments involving the covering lemmas in [2S1 Chapter 2] 
may be used to deduce the above forms from the standard ones. The proofs of the 
above two theorems extend easily to the case of bounded torsion G and torsion-free G 
respectively. It is also possible to establish a result valid for all abelian groups at once, 
and containing the above two results as special cases: see [H] for details. 

There seems to be a general feeling that the bounds in these results are not optimal, 
and the so-called Polynomial Freiman-Ruzsa conjecture (PFR) has been proposed as a 
suggestion for what might be true. 

Conjecture 1.5 (PFR over F£°). Suppose that A C F^° is a K -approximate group. 
Then A is K c -controlled by a finite subgroup. 

Conjecture 1.6 (Weak PFR over Z). Suppose that A C Z is a K -approximate group. 
Then A is e K ° {1> -controlled by a symmetric generalised arithmetic progression P = 
{l\X\ + • ■ ■ + IdXd ■ \k\ ^ Li} with dimension d ^ K o( ^\ where oil) denotes a quantity 
bounded in magnitude by c{K) for some function c of K that goes to zero as K — > oo. 

Conjecture 11.51 has been stated in several places, and in the article [TU] unpublished 
work of Ruzsa was discussed, establishing a number of equivalent forms of it. According 
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to Ruzsa [20], the first person to make a conjecture equivalent to the PFR over 
was Katalin Marton. Conjecture 11.61 concerning approximate subgroups of Z, does 
not to our knowledge appear explicitly in the literature, although something close to 
it was suggested by Gowers [9]. One might very optimistically conjecture that a K- 
approximate subgroup of Z is K c -controlled by the affine image of the set of lattice 
points inside a convex body of dimension O (log .ft"). Such a conjecture might deserve 
to be called the PFR over Z (rather than the weak PFR), since it is nontrivial even if 
K is a suitably small power of \A\. A number of issues are rather unclear concerning 
such a formulation, one of them being whether it suffices to consider boxes rather than 
arbitrary convex bodies. This question appears to involve somewhat subtle issues from 
convex geometry and we will not consider it, or indeed any aspect of the stronger version 
of the PFR over Z, any further in this paper. 

approximate polynomials We turn now to what appears to be a completely 
unrelated topic. Let G = (G, +) be a finite abelian group, and recall the definition of 
the Gowers norms. If / : G — > C is a function we define 

WfWuHG) := (^,heGf(x)f(x + h))^ 2 

WfWuHG) ■= (E xMM£G f(x)f(x + h 1 )f(x + h 2 )f(x + h, + h 2 )) l ' A 
\\f\\u*(G) := ( E 'x,h 1 MMeGf(x)f(x + h)f(x + h 2 )f(x + h 3 )x 

x f(x + h + h 2 )f(x + h + h 3 )f(x + h 2 + h^fix + h + ^ + h))^ 8 

and so forth, where we use the averaging notation E, xe Af(x) := pr XLeA f( x )- ^ n this 
paper we shall be working primarily with the [7 3 (G)-norm. It is clear that if ||/||oo ^ 1 
and 1 1 /H c/3(g) = 1 then we necessarily have f(x) = e(<p(x)), where : G — » R/Z is 
a quadratic polynomial in the sense that Ah 2 Ah 3 (f>(x) = for all hi,h 2 ,h 3 ,x G G, 
where A^0(x) := <fi(x + h) — 4>{x). To justify the terminology, observe that when 
G = Z/iVZ with iV odd it is an easy matter to check that any quadratic polynomial has 
the form <f>(x) = j^ax 2 + j^bx + c for a, b G Z/NZ and c G R/Z, where ^ : Z/NZ -> R/Z 
is the usual embedding. 

The inverse problem for the Gowers U 3 -norm asks what can be said about functions 
/ : G — ► C for which ||/||oo ^ 1 an d ||/||c/ 3 (G) ^ 1/K. In view of the above discussion, 
it is reasonable to call such functions / K -approximate quadratics. 

The analogue of control in this setting is correlation. We say that a function / : 
G — » C 5-correlates with another function F : G — > C if the inner product (f,F) : = 
K xe Gf(x)F(x) is at least S. 

In the finite field setting, the following inverse theorem was shown in [23] . 
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Theorem 1.7 (Inverse theorem for U 3 (¥%)). Suppose that f : F£ — > C «s a K- 

approximate quadratic for some K ^ 2. T/ien / exp(—K c )- correlates with a phase 
(— 1)^ /or some quadratic polynomial ip : F£ — > F 2 . 

Remark. The phase ^(x) m ay be written explicitly, relative to a basis, as -^(x) : = 
x ■ Mx + b ■ x + c, where M : F 2 — > F 2 is a linear transformation, b G F 2 , c G F 2 , and 
6 • x is the usual dot product in F 2 . 



In Z/iVZ there is an analogous result, which we recall in Theorem 11.91 below. To 
state it we recall some of the terminology from [13] concerning nilsequences. 

Definition 1.8 (Nilsequences). A 2-step nilmanifold is a homogeneous space G/T, 
where G is a nilpotent Lie groups of step at most 2, and T is a discrete cocompact 
subgroup. A fundamental 2-step nilmanifold is one of the following three examples of a 
2-step nilmanifold: 



We place smooth metrics on each of these nilmanifolds; the exact choice of metric is not 
important. An elementary 2-step nilmanifold is a Cartesian product of finitely many 
fundamental 2-step nilmanifolds, with the product metric. Again, the exact convention 
for defining the product metric is not important. An elementary 2-step nilsequence is 
a sequence of the form n \— > F(g n xo), where G/T is an elementary 2-step nilmanifold, 
F : G/T — ► C is a Lipschitz function, g G G, and Xq G T. 

Remarks. If one only had the unit circle and not the skew torus and Heisenberg 
nilmanifold, the notion of an elementary 2-step nilsequence would collapse to that of a 
quasiperiodic sequence. It is not hard to see that the unit circle and skew torus can be 
embedded into the Heisenberg nilmanifold, and so one may work entirely with products 
of Heisenberg nilmanifolds if one wished. For further discussion of nilsequences see 



Theorem 1.9 (Inverse theorem for [/^(Z/iVZ)). Suppose that f : Z/iVZ — >• C is a 
K -approximate quadratic for some K ^ 2. Then f exp(— K c ) -correlates with an ele- 
mentary 2-step nilsequence F(g n xo), where F : G/T — » C is Lipschitz with Lipschitz 
constant at most exp(K c ) , g G G, x G G/T and G/T is an elementary 2-step nilsystem 
of dimension at most K° . 

Remarks. The proofs of Theorems 11.71 and 11.91 depend very heavily on earlier work 
of Gowers [S]. In Theorem 11.91 one can replace the notion of an elementary 2-step 



• (Unit circle) G = K and T = Z. 




[DEI E31 [TEH?]. 
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nilsequence n t— > F(g n xo) by the more concrete notion of a bracket phase polynomial 



where aj, (3j, jj, Sk,r]k G M, {x} is the fractional part of x (defined to lie in (—1/2, 1/2]), 
and d, d' are integers of size at most K c . See [21 [13] for further discussion. 

Once again, it is not generally thought that the bounds in these two results are best 
possible. The following two conjectures might be referred to as the Polynomial inverse 
conjectures for the U 3 Gowers norms, or PGI(3) for short. 

Conjecture 1.10 (PGI(3) over Fg). Suppose that f : — > C is a K -approximate 
quadratic. Then f K~ c -correlates with a quadratic phase (—1)^. 

Conjecture 1.11 (Weak PGI(3) over Z/NZ). Suppose that f : Z/NZ -> C is a K- 
approximate quadratic. Then f ex^^—K ^)- correlates with an elementary 2-step nilse- 
quence F(g n Xo), where F : G/T — ► C is Lipschitz of order at most exp(i^°( 1 ^) ; g G G, 
Xo G G/T and G/T is an elementary 2-step nilsystem of of dimension at most K olyl \ 

Remarks. The second of these conjectures deserves some comment. Usually, when 
inverse conjectures for the Gowers norms are discussed (for example in [15]) there is no 
restriction to elementary nilsequences. We have made this restriction here to simplify 
the discussion, and in particular to avoid the need to involve the quantitative theory of 
2-step nilmanifolds in general as was done in the first two sections of [16] . However it 
transpires that Conj ecture II . 1 II is implied by the same conjecture without the restriction 
to elementary nilsequences, simply because every 2-step nilsequence may be closely 
approximated by a weighted sum of elementary 2-step nilsequences. We omit the details 
of this deduction, which can be obtained from the calculations in Appendix B of [12] . 

We do not dare, at this stage, to even formulate a strong PGI(3) conjecture over 
Z/NZ. To do so would appear to involve rather subtle issues connected with the exact 
definition of complexity of a nilsequence. 

We are now in a position to state our main results. 

Theorem 1.12 (Equivalence of PFR and PGI(3), finite field version). Conjecture \l.S\ 
and Conjecture \1.1(A are equivalent. 

Remark. A similar result would hold over ¥ p , for any fixed prime p, though the 
exponents obtained would depend on p. 



d 



d! 




(1.1) 



Theorem 1.13 (Equivalence of PFR and PGI(3), Z- version). Conjecture \1.6i and Con- 
jecture \1.11\ are equivalent. 
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The fact that Conjecture 11.51 implies Conjecture 11.101 follows by a modification of 
Samorodnitsky's argument [23], and similarly the fact that Conjecture 11.61 implies Con- 
jecture [TTTT] follows from modification of [13J. Both arguments are strongly dependent 
on the work of Gowers mentioned earlier. The details of these deductions are a little 
technical and are discussed in Appendix [A] However, the main novelty of our paper lies 
in the opposite implications PGI(3) => PFR, the discussion of which forms the main 
body of this paper. 

Remark. The methods used to prove Theorems 11.121 11.131 also establish an equiva- 
lence between Theorem 11.21 and Theorem ll.7[ and between Theorem 11.31 and Theorem 



11.91 though such an equivalence is redundant given that all four theorems have already 
been proven in the literature. 

Let us conclude by remarking that Shachar Lovett informed us that he independently 
observed Theorem 11.121 



We turn now to the proof that Conjecture 11.51 implies Conjecture 11.101 that is to say 
the PGI(3) implies the PFR over the finite field Fg. The argument proceeds via the 
following intermediate result concerning the structure of approximate homomorphisms 
on the infinite vector space F£° := [J n 

Lemma 2.1 (Approximate homomorphisms). Assume Conjecture \1.1(K Suppose that 
S C F2 is a set of cardinality o1 n for some < a < 1/2, and that : S — > Fif 
is a Freiman homomorphism on S, i.e. <p{x{) + 0(2:2) = 0(2:3) + 0(2:4) whenever 
Xi,X2,X3,X4 G S are such that X\ + 22 = 23 + 2:4. Then there is an affine linear 
map ip : F2 — > FSf 3 such that <f>(x) = ip(x) for at least a c 1 n values of x G S . 

Remark. By combining this lemma with known additive-combinatorial results one 
could obtain the conclusion of Lemma 12.11 under a priori weaker assumptions, for ex- 
ample that P(0(xi) + 0(2:2) = 0(2:3) + 0(2:4) l^i + %2 = %3 + 2:4) is large. Indeed a map 
of this type restricts to a Freiman homomorphism on a large set S by arguments of 
Gowers and Ruzsa (see 0, Section 7]). 

Let us first show how Conjecture II .61 follows from Lemma I2TT1 Suppose that A C FJf 
is a /^-approximate group, and let n be minimal such that there exists a linear map 
7r : Fif — > Fg which is a Freiman isomorphism^ from A to tt(A); this quantity n, which 
one can view as a sort of "rank" or "dimension" for A, is finite since A is finite. If there is 

1 A Freiman isomorphism is a Freiman homomorphism which is invertible and whose inverse is also a 
Freiman homomorphism. 



2. The finite field case 
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some§|:r G F2\47r(y4) then we could compose ir with the projection map ip : F2 — > F£/ (#) 
to obtain a linear map 7r : FSf -> F^ -1 which is a Freiman isomorphism when restricted 
to A, contrary to the assumed minimality of n. It follows that A%(A) = F£. But tt(A) is 
Freiman isomorphic to A, which is a K- approximate group. It follows that the doubling 
constant |27r(yl)|/|7r(^4)| is at most K, and hence by Ruzsa's sumset estimates (cf. [26| 
Corollary 2.23]) that 2" = |4tt(A)| K°\A\. 

What we have done here is find a "dense model" tt(A) C F2 of the set A; the simple 
argument we used to do so is the finite field analogue of an argument of Ruzsa [20] that 
we shall recall later in the paper. Write S = n(A) and for the inverse of tt, restricted to 
S. Then is a Freiman homomorphism on S and the set A is precisely the image 4>(S). 
Applying Lemma [2.11 we see that at least X |A| of the elements of A are contained 
in a coset of an n- dimensional subspace H ^ FSf. Finally, it follows immediately from 
standard covering lemmas (cf. [26j Section 2.4]) that A is i^^-controlled by H. □ 

It remains, then, to establish Lemma 12.11 The key observation linking Freiman 
homomorphisms to approximate quadratics is the following lemma. 

Lemma 2.2. Suppose that S C ¥ 2 is a set of size a2 n for some < a < 1/2 and 
that <f> : S —* FJf is a Freiman homomorphism. The image of <p certainly lies in 
some finite- dimensional subspace F^. If f : ¥'2 +N — * [—1, 1] is the function f(x,y) : = 
l s (x)(-l)«*», then \\f\\ uHn+N) ^a. 

Proof. Consider a parallelopiped (x + u ■ h,y + u ■ A:)o;e{o,i} 3 i n the support of /, 
where h = (hi, h 2 , h 3 ), k = (ki, k^), x, h%, h 2 , h 3 G F2 and y, k±, k 2 , k 3 G F^. Then 
x + uj ■ h G S for all u G {0, l} 3 . Since is a Freiman homomorphism on S, we see that 
(p(x + lo ■ h) depends linearly on u, and so <f>{x + to ■ h) ■ (y + to ■ k) depends quadratically 
on uj. Since {0, l} 3 is three-dimensional, we conclude that 

{-l) H <P{x + uj-h)-{y + uj-k) = 

we{o,i} 3 

where |o;| is the number of Is in the coefficients of u (actually, as we are working in F 2 
here, the (—1)'^' factor could in fact be ignored). From this and the definition of / and 
the U 3 (¥ 2 +N ) norm we conclude that 

\\f\\u 3 {F%+ N ) = \\ 1 s\\u»(F^+ N )- 

The behaviour in the y index is now trivial, and therefore 

II 1 s|Ic/3(fj+ jv ) = l|ls||t^^5)- 

2 We use kA = A + . . . + A to denote the fc-fold iterated sumset of A, thus Att(A) = ir(A) + n(A) + 
tt(A) + tt(A). Note in F2 that there is no distinction between sums and differences, thus for instance 
Att{A) = 2it{A) - 2%{A). 
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Meanwhile, ||ls , ||c/ 1 (F5) ^ o~ by hypothesis. The claim now follows from the monotonicity 
of the Gowers norms (see, for example, [2111 eq. H-7]). □ 

Now suppose S, a are as in the statement of Lemma I2TT1 and let N and / be as in the 
above lemma. Assuming Conjecture 11.101 for this choice of /, there exists a quadratic 
polynomial * : ¥% +N -> F 2 such that 

\E^E^l 8 {x)(-l)^{-l)^ > °° ■ 
Thus, for at least ^ a c 2 n values of x 6 S, one has 

KjeF?(-^ {x) ' y (-lf {x ' y) \ >o c . (2.1) 
Let us fix x so that (12. ip holds. We may split ty(x,y) as 

(z, y) = *(0, j/) + 0) - *(0, 0) + y) (2.2) 
where 5 is the "mixed derivative" of defined as 

B(x, 2/) := (a;, y) - *{x, 0) - (0, y) + (0, 0). 
From ( 12. ip it thus follows that 




As \P is quadratic, 5 is bilinear in x and y, and hence -B(x, y) = ip(x) ■ y for some linear 
map %j) : F^ -> F£. We conclude that 

|Ej /eF ^(-l) Wa;) " V ' (:E)) " J (-l)' I ' (0 * y) | > 

which means that the function ?/ i— > (— has a Fourier coefficient of size at least 
<j c at 4>(x) — tp(x). Hence by Plancherel's theorem the number of such large Fourier 
coefficients is at most o~~ 2C . We conclude that <p(x) — ip(x) takes at most o~~ 2C values 
on at least a c 2 n values of x G S, and the claim follows from the pigeonhole principle .□ 

3. The integer case 

We turn now to the proof that Conj ecture 11.111 implies Conjecture II .51 This argument 
goes along similar lines to that in the previous section, but is somewhat more involved 
since one must deal with nilsequences rather than quadratic forms. We present the 
argument in such a way as to emphasise the close parallels with the preceding section. 

Once again matters rest on a reduction to an inverse theorem for approximate ho- 
momorphisms. We write [N] for the set {1, . . . , N}. 

Lemma 3.1 (Approximate homomorphisms) . Assume Conjecture \1.11[ Suppose that 
N is a positive integer, that S C [N] is a set of cardinality o~N , and that <fi : S — > Z 
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is a Freiman homomorphism on S. Then there is a generalised arithmetic progression 



The proof that this lemma implies Conjecture 11.51 is not particularly onerous and 
goes along much the same lines as the argument in the previous section. Supposing 
that A C Z is a i^-approximate subgroup, Ruzsa's "model lemma" pj2, Theorem 2] 
implies that there is a N ^ i^AI together with a subset A' C A of cardinality at least 
\A\/2 and a Freiman isomorphism tc : A' — ► S to a subset 5 C [iV]. Write <fi := 7r _1 , 
and observe that <ft : 5 1 — > Z has image 0(5*) = A'. Noting that |5*| ^ K~ C N, it follows 
from Lemma 13.11 and the fact that Freiman isomorphisms take generalised progressions 
to generalised progressions (see [26], Proposition 5.24]) that at least exp(— i^ 1 ))^ of 
A is contained in a generalised progression in Z of dimension and cardinality at 

most iV ^ K \A\. Once again, standard covering arguments complete the deduction of 
Conjecture 11.51 □ 

It remains to prove Lemma 13.11 The starting point is the following analogue of 
Lemma 12.21 showing how to convert Freiman homomorphisms to approximate quadrat- 
ics. 

Lemma 3.2. Let N, M ^ 1 be integers, let S C [N] be such that \S\ ^ o~N, and let 
cf) : S — ► Z/MZ be a Freiman homomorphism. Define a function f : Z/4iVMZ — > C by 



Proof. Every parallelepiped in the support of / is of the form (x + u ■ h, AN(y + uj ■ 
fe))o;e{o,i} 3 > where y G Z/MZ, k = (hi, k 2 , k 3 ) G (Z/MZ) 3 , and x + u ■ h G S. By arguing 
exactly as in Lemma 12.21 we have that 



P C [N] of dimension a °W and size at least exp(— a °^)N together with a Freiman 
homomorphism ip : P — > Z suc/i £/iat </>(#) = ^(x) for at least exp(—a~°^)N values of 



x G S. 




where bm{x) '■= e 2mx l M ; and ANy G Z/4iVMZ is defined in the obvious manner for 
y G Z/MZ. TTien ||/||{/3 (z/4A r MZ) ^ |o\ 



^ (-1) m 0(x + ^-/i)(?/ + cj- A;) = 




1 



s\\U 1 (Z./4:NMZ) ^ Cr/4 



and the claim follows as before from the monotonicity of the Gowers norms. 



□ 
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We return now to the proof of Lemma 13.11 That lemma deals with Freiman homo- 
morphisms <fi : S — > Z. However such a map is a Freiman homomorphism if and only if 
the composition ttm ° 4> is a Freiman homomorphism for all sufficiently large M, and so 
we may suppose instead that <fi maps S to Z/MZ for some M. 

Let / be as in Lemma 13.21 Assuming Conjecture II. Ill it follows that there is some 
elementary 2-step nilmanifold G/Y of dimension at most a function F : G /Y ^ £ 

of Lipschitz constant at most exp(cr _o( ^ 1 ^), g E G, and x$ E G/Y such that 

Ke[N]^em^s(x)e M ^(x)y)F{g x ^^x )\ > exp(-a-°«). 

Writing xq = goY for some go E G of distance at most exp(a~°^) from the origin, 
and rewriting g x+iNy xo = gog x+4:Ny Y where g := g ~ 1 ggo, we see (after shifting F by go 
and replacing g by g if necessary) that we may normalise x$ to be at the origin Y. By 
embedding the skew torus in the Heisenberg group if necessary we may take G to be a 
product of Heisenberg groups and hence, in particular, connected and simply-connected. 

The vertical torus [G,G]/(Y n [G, G]) of the elementary 2-step nilmanifold can be 
identified with a torus (IR/Z)^ 2 for some d 2 ^ a^ ^\ By standard harmonic analysis ar- 
guments (see, for example, [12] Lemma A. 9]) the Lipschitz function may be decomposed 
into a linear combination of at most exp(er - °W) Fourier characters along the vertical di- 
rection with coefficients of magnitude at most exp(a~°^), plus an error of exp(— a~°^) 
in L°°. Applying the pigeonhole principle it follows that one may assume that F is a 
vertical character, which means that there exists a character x '■ [G, G]/(Yn [G, G]) — > S 1 
such that 

F(g 2 x)= X (92)F(x) (3.1) 

for all x E G/Y and g 2 E [G, G] (where we lift x to [G, G] in the obvious fashion). 

The Lipschitz function \F\is now invariant under the action of the vertical torus and 
descends to a function on the horizontal torus G/Y[G, G], which can be identified with 
a torus (M./Z) dl for some d\ ^ cr -0 ^). By applying a Lipschitz partition of unity we 
may assume that \F\ (and hence F) is supported in a small ball in this torus, of radius 
less than exp(— a~°^) say. 

By the pigeonhole principle, we can now find ^ exp(— a~ o( - 1 ' l )N values of x E S such 
that 

\E ye[M] e M (<j ) (x)y)F(g x+m yY)\ > exp(-a~°^). 

By pigeonholing in x (reducing the number of available a; by a factor of exp(— a~°^)), 
we may assume that for all these x the point g x Y lies in a small ball B in G/Y, of radius 
less than exp(cr~°W). 

We turn now to the task of simplifying F(g x+4:Ny Y): this may be thought of, roughly, 
as a quest to find a suitable analogue for the decomposition (12. 2p . To begin with let us 
expand g x as {g x } \_g x \ , where [^J € Y and {g x } lies in a fundamental domain of G/Y 
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that contains B in its interioj^. As usual, write [g, h] := ghg^h^ 1 for the commutator 
of two elements g and h in some ambient group. Now in any 2-step nilpotent group G 
we have [x n , y] = [x, y] n for all x, y G G and all fiGZ: this follows from the commutator 
identity [xy, z] = [y, z] x [x, z], which is valid in all groups. It follows that 

9 x+4Ny T = g 4Ny {g x }Y = [g iN , {g x }} y {g x }g 4Ny Y. 

Since F is a vertical character, we thus see that 

F{g x+iNy T) = x{[g iN A9 x }]) y F{{g x }g iNy T) 

and so 

|E, e[ M ]e ([^(*) -1>(x)]v)F(W}g Wv r)\ > exp(-a-°( 1 )) 
for at least exp(— a~°^)N values of x G S, where if)(x) G R/Z is the phase such that 

x{[g m Ag x }]) = e{Hx)). 

By construction, {g x } is supported in a small ball centred at some go G G, of radius 
less than exp(— a~°^). Provided that this ball is chosen small enough, the Lipschitz 
nature of F guarantees that 

Ke[M]e([^(x) - ij(x)}y)F(g g m yT)\ > exp(-a-^). 

Recall that \F\ has small support, on account of the partition of unity that was brought 
into play earlier in the argument. We now let Fq : G/Y — > C be another function of Lip- 
schitz constant at most exp(cr _0 ^ 1 ^) and with vertical character \ which has magnitude 
1 on the support of F(g -); there are no topological obstructions to building such an F 
if the support of \F\ is small enough (think, for example, of the function ip(x,y, z)e(z) 
on the Heisenberg nilmanifold G/Y, where ip is supported on a small ball and equals 1 
on a very small ball about the origin in the fundamental domain {— |, |}). 
With this function F constructed we may write 

F(g g m yY) = F(^ 4 ^r)F (^ 4 ^r) 

where F : G/Y — > C is the function 

F(x) := F{g x)T {x). 

Observe that the function F(x) is invariant under the action of the vertical torus, and 
thus descends to a function on (R/Z) dl , which by abuse of notation we also call F. Thus 

F(g g m VY) = F(vr(^r))F (^r), 

where tt : G/Y — > (IR/Z) a!l is the projection onto the horizontal torus. 

3 Several papers of the authors - for example the appendix of [12j - contain example computations of 
{g x } and [g x \ on the Heisenberg group for fundamental domains like { — i, ^} or [0, l] 3 . 
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The function F is Lipschitz with constant at most exp(a~°^), and so (by [T2"j 
Lemma A. 9]) can be decomposed into a combination of at most exp(<7~° ( ' 1 * ) ) characters 
with coefficients at most exp(cr _ °( 1 ^), plus an error of size exp(— a^°^). Meanwhile, 
ft (9 y T) G (M/Z) dl evolves linearly in y. By the pigeonhole principle, refining the set 
of available x some more, we may thus assume that 

Ke[M]e([^<f>(x) -^(x)}y)e(^y)Fo(g 4Ny T)\ > exp(-a-°«). (3.2) 

for some £0 £ R/Z independent of x, and for at least exp(— a~°^)N values of x. Thus, 
the function y \— > FQ(g 4Ny T) has a large Fourier coefficient at j^4>(x) — ip(x) + £o- 

In the finite field argument we applied Plancherel's theorem at this point. Here the 
appropriate tool is the large sieve, a kind of approximate version of Plancherel which 
states that a function / : [M] — > C cannot have large Fourier coefficients at many 
separated points. The following (standard) statement of it may be found in [5j Ch. 27]: 
if the points #i, R/Z are (^-separated then 

K 

El E f(vWi)\ 2 <(M + r l ) E \f(y)\ 2 - 

3=1 ye[M] ye[M] 

Applying this to (13. 2\\ and the remark following it, we see that the large Fourier coef- 
ficients Tj(j>{x) — ip(x) + £0 of the function y \— > F (g 4Ar:y r) can be covered by at most 
exp(er~°W) arcs of length 1/Mon the unit circle R/Z. Pigeonholing, and refining the 
set of x by yet another factor of exp(— a~°^), we may assume that j^(p(x) — ip(x) + £0 
lies inside a fixed arc of length 1Q ^ M . This implies, refining the set of x one more time, 
that we may find a £1 G R/Z such that for at least exp(— a~°^)N values of x G S, 
jj(j)(x) - ip(x) + £1 G [-1/100M, 1/100M]. 

By direct computations on the Heisenberg group along the lines of those in [12] we 
see that Tr({g x }) = (acix, . . . , a^x) for some ax, ... , G R/Z, and then that 

)(([9 W K}])=e£ft{^-7 J }) 
3=1 

for some /3j, jj G R/Z independent of x. Here the fractional part {£} of t G R is chosen 
to lie in (— |, |], and the need for the shift 7j arises from the fact that {g x } is chosen 
to lie in a fundamental domain of G/T containing B in its interior. 
This means, of course, that 

di 

3=1 

The set of all x G [N] such that %(g x T) lies within exp(— a^°^) of the origin is a 
Bohr set of rank at most a~°^ and radius at least exp(— a~ ^), and hence by [20, 
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Theorem 3.1] (reproduced as Theorem IB.2I in the appendix) it contains a proper sym- 
metric generalised arithmetic progression P of dimension at most o~~°^ and cardinality 
at least exp(— o~°^)N . By discarding generators of P if necessary we may assume 
that all sidelengths of P are at least Co for some constant Co to be specified later. 
By standard covering lemmas such as [26J Lemma 2.14] we may cover [N] by at most 
exp(a~°^ N) translates of P, so by the pigeonhole principle we may assume that all the 
x under discussion, that is to say those x for which ajX ~ jj, are contained in a single 
translate Xq + P of P. Note that each map x \— > {ajX — jj} is a Freiman homomorphism 
on Xo + P and hence so is the entire phase if). 

If we let Q be the set of all x G x + P such that ip{x) — £i lies within ^ M of a 
multiple jj4>(x) of -h, where <j>(x) G Z/MZ, then we conclude upon rounding to the 
nearest multiple of jr that is a Freiman homomorphism on Q. Also, from construction 
we see that <f>{x) = <f>(x) for at least exp(— a~°^)N values of x e S C\Q. 

To conclude the argument one needs to show that Q contains a generalised arithmetic 
progression of dimension at most a^°^ and cardinality at least exp(— a~°^)N (since 
one can then cover Q by at most exp(cr~°^ 1 ^) translates of such a progression). This 
will follow straightforwardly from the following lemma which, though it looks to be of 
a standard type, does not appear to be in the literature. A proof may be found in 
Appendix [Bi 

Lemma IB. 11 Let e G (0,1/2) be a real number. Suppose that P is a d- dimensional 
proper progression with sidelengths Ni,...,Nd > C/e and that rj : P — > R/Z is a 
Freiman homomorphism which vanishes at some point of P. Then the set {x G P : 
||^(^)||r/z ^ contains a progression of dimension at most d + 1 and size at least 
{Cd)- d s d+1 \P\. 

We shall apply the lemma with e = 1/100, this being valid if the constant Co was 
chosen to be large enough earlier on. Recall that there are many x G S fl (xo + P) such 
that j } (f>(x)-ip(x) + ^ 1 G [-1/100M, 1/100M]. Pick one such x*, and take £ 2 to be such 
that jj<p(x*)—ip(x*)+^2 = and ||£i — 62IU/Z ^ 1/100M. Now we simply apply Lemma 
IB. II to the progression Xo + P, taking n = M{ip — £ 2 ) and e = 1/100. The progression 
Q contains the set {x G x + P : ||77(x)||m/z ^ 1/100}, and of course 77 vanishes at 
x*. It follows from Lemma lB.ll that Q does indeed contain a generalised arithmetic 
progression of dimension at most a~°^ and cardinality at least exp(— a~ o( -^)N, and 




□ 



4. Higher order correspondences 



It appears that the correspondence between inverse sumset theorems and inverse 
conjectures for the Gowers norms have some partial higher order analogues, although 
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the situation here is much less well understood. To illustrate this phenomenon, consider 
the following result, recently proven in [3J, [2Zj- Here and for the rest of the section we 
write F := F 5 for definiteness, although the same arguments would work for ¥ p for any 
fixed prime p ^ 5. There are definite issues in extremely low characteristic: see for 
example [Til [18] . 

Theorem 4.1 (GI(4) over F n ). For every K ^ 2 there exists an e > such that if 
f : F n — * C is a K -approximate cubic in the sense that ||/||oo ^ 1 and ||/||e/ 4 (f™) ^ 1/K , 
then f e-correlates with a {genuine) cubic phase e ¥ (ip), where e ¥ (x) := e 27nX// ' F ' and 
ip : F n — > F is cubic in the sense that A/^ . . . A/j 4 ^(x) = for all x, hi, . . . , h^ G F n . 

We shall use this theorem to establish the following variant of Lemma 12.11 

Proposition 4.2 (Approximate quadratic homomorphisms) . Suppose that a G (0, 1/2), 
that S C F n is a set of cardinality cr|F| n ; and that <fi : S — > F°° is a Freiman quadratic 
on S in the sense that X^e{o i} 3 ( — + h ■ uS) =0 whenever x G F n and h = 
(hi, h2, h%) with hi, /12, h% G F™ are such that x + u ■ h G S . Then there is a quadratic 
map if) : F n — > F°° such that <p(x) = if>(x) for at least e\¥\ n values of x G S , where 
e = e(cr) > depends only on a. 

The initial stages of the proof are very similar to those of Theorem 11.121 and we 
just sketch them. As before, we let N be large enough that takes values in F^, and 
considers the function / : ¥ n+N — > C defined by f(x,y) := ls(x)e^((f>(x) ■ y). A routine 
modification of Lemma 12.21 reveals that 

and thus by Theorem 14.11 we can find a cubic ^ : ¥ n+N — > F such that 

\E xe¥ nE ye¥ N l 5 (x)e F (0(x) • y - ^(x, y)) | >a 1, 

where here we use X ^>s Y to denote the estimate X ^ C^Y for some Cs depending 
only on 5. Thus for ^>s |F| n values of x G S, one has 

\E ye¥ Ne ¥ ((f)(x) ■ y - ^(x,y))\ ><5 1. 

The next step is to perform a decomposition of ^ analogous to (12.21) . but unfortunately 
the analogous decomposition is not so favourable. Namely, one has 

y) = (0, y) + Q x (y) + if)(x) ■ y + P(x) 

where Q x : ¥ N ->• F is a quadratic polynomial that varies affine-linearly in x, ip :¥ n 
¥ N is a quadratic polynomial, and P : ¥ n — > ¥ is a cubic polynomial. We thus have 

\E ye¥N e ¥ ((<P(x) - ifj{x)) ■ y - Q x (y) - (0, y))| > 5 1 (4.1) 
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for ~^>$ |F| n values of x G S. 

The factor of e ¥ (-Q x (y)) in the functions f x (y) := e ¥ ((<p(x) - ip{x)) ■ y - Q x (y)) 
prevents one from immediately using Plancherel's theorem as in Section [21 However, 
from standard Gauss sum estimates (see e.g. [Ml Lemma 1.6]) we do have 

|(/„/,')l«|Fr rk(Q ^' )/2 (4.2) 

for any x, x'. Here the rank of a quadratic form Q can be defined as the rank of the 
symmetric matrix describing the homogeneous part of Q. By standard linear algebra 
there is a vector subspace Vq ^ F n with dim(Vg) = rk(Q) such that Q(y) is a quadratic 
function of the inner products (v,y), v G Vq. 

From (14.21) and a standard duality argument related to the large sieve (see, for 
example, [3, Ch. 27, Theorem 1]) one can show that there cannot exist k different 
Xi,...,Xk G S obeying (14. ip with rk(Q Xt — Q x .) ^ k, if k is large enough depending 
on 5. By the greedy algorithm, we may thus find xi,...,Xk with k <C5 1 such that 
minx^j^fc rk(Qa; — Q Xi ) 1 for all x obeying (14.11) . By pigeonholing in the x parame- 
ter, we conclude that there exists a quadratic form Q Xl such that rk(Q x — Q Xl ) <C<5 1 
for ~^>s |F| n values of x G S. By translating we may normalise and take x\ = 0. 

Write Q' x be the homogeneous quadratic component of Q x — Qo, so that Q' x depends 
linearly on x and ik{Q' x ) <C<5 1 for ^>s |F| n values of x G S. Key to our argument is the 
following proposition concerning this situation, which may be of independent interest. 
It states that a linear function to the set of low-rank quadratics must, in a sense, be 
quite trivial. 

Proposition 4.3 (Triviality of linearly varying low-rank quadratic forms). Let r G N 

and suppose that e G (0, 1] is a real number. Suppose that x t— > Q x is a linear map from 
¥ n to the space of homogeneous quadratics over ¥ N . For each such form Q x associate 
the vector space V x := Vq x . Suppose that there is a set A of at least a\¥\ n values of x 
for which ik(Q' x ) ^ r. Then there is some vector space V F n ; dim(K) ^ r, such that 
Vx Q V for at least a' (a, r)|F| n values of x G A, where a : (0, 1] x No — > M takes positive 
values. 

Proof. We claim that under the stated hypotheses there is some vector v which lies 
in at least ao(a,r)|F| n of the spaces V x , where ao is a function taking positive values. 
The proposition then follows quickly by induction on r, upon passing to a coset of the 
codimension one subspace v ^ F ra which contains at least alt" -1 ! elements of A. 

Now by a standard application of Cauchy-Schwarz (see, e.g, [26], Corollary 2.10]) there 
are at least a 4 |F| 3ri additive quadruples in A, that is to say quadruples (xi, x 2 , x 3 , x 4 ) G 
A 4 with X1 + X2 = X3 + X4. We say that such a quadruple is gooci if V Xi n(V x . + V Xk ) = {0} 
for all 24 choices of distinct k G {1, 2, 3, 4}. 
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Case 1. At least half of the additive quadruples in A are good. Fix a good quadruple 
(xi, x 2 , x 3 , x 4 ) G A 4 . Let y,h,k G F n be arbitrary, and select h! G (h + Vj-J n (Vj~ 2 n V^) 
and fc' G (/c + V^) fl Vj^. Straightforward linear algebra (and the goodness of the 
quadruple (xi, x 2 , x%, X4)) confirms that this is possible. 

From the linearity of the map x 1— > Q x we have 

Q xi (y) + Qx 2 (y) - Q X3 (y) - Qx 4 (y) = o 

and 

Q Xl (y + ti) + Q X2 (y + ti) - Q X3 (y + ti) - Q X4 (y + ti) = 0. 
Since b! G V x \ fl V x ^ the second of these implies that 

QxAv + ti) + Q« 2 (y) - Q X3 (y) - Q X4 (y + /i') = 0. 
Subtracting the first equation yields 

QxM ~ Q*Av + ti) - Q X4 (y) + Q X4 (y + ti) = 0. 

Substituting y + k' for y, recalling that k! G Vj^, and subtracting, this implies that 

Q X1 (y) - Q X1 (y + ti) - Q X1 (y + k') + Q Xl (y + ti + k') = 0. 

But h — h' and k — k! both lie in V^, and so this implies that 

QxM ~ QxAv + h)~ QxAv + k) + Q Xl {y + h + k)= 0. 

Since Q X1 is a homogeneous quadratic and h, k (and y) were arbitrary, this last equation 
implies that Q X1 is in fact zero. 

Since no x can be the X\ term of more than |F| 2ri additive quadruples, it follows that 
Q x = for at least Y^,a 4 |IF| n values of x. On the other hand, the set of x where Q x = 
is a subspace of F n , and the claim is thus verified in this case. 

Case 2. At least half of the additive quadruples in A are bad. Then (for example) 
there are at least « 4 |IF| 3ri quadruples (x±, x 2 , £3, X4) G A 4 with 14 1 n(T4 2 + T4 3 ) 7^ {0}. 
Since the first three terms x±,x 2 and £3 of an additive quadruple determine the fourth, 
it follows easily that there is some choice of x 2 , x 3 such that V X1 fl (V X2 + V X3 ) 7^ {0} for 
at least y^q; 2 |F|™ values of x\. Since V X2 + V X3 is a vector space of dimension at most 
2r, the claim follows in this case with a (a,r) = ■^a 2 \¥\~ 2r . 

We have verified the claim (with a (a,r) = -^a 4 \W\~ 2r , say) in all cases and hence 
the proposition is proved. □ 

Remark. An inspection of the argument reveals that the function a'(a,r) in this 
proposition can be taken to have the form (a/C)° r . 
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Let us return now to (14.1 1) , which stated that 

\M yewN e v (((j)(x) - ip(x)) ■ y - Q x (y) - tf(0, y))\ > 5 1 

for ^>,5 |F| n values of x G S. In the subsequent discussion we passed to a further subset 
of ^>s |F| n values of x for which ik{Q x — Qq) <C<s 1 Writing Q' x for the homogeneous 
quadratic part of Q x — Q , we may use Proposition 14.31 to assert that there is some 
subspace V ^ F , dimV^ <C<5 1, such that Q' x (y) is a quadratic function of the inner 
products (v,y), v 6 V. The coefficients of this quadratic function vary linearly in x, 
but this is unimportant. 

By foliating into cosets of V- 1 , we may find a 1-bounded function F supported on 
some coset t + V ± and a quadratic polynomial ip : F n — > F^ such that 

E J/eF ivF(?/)e F ((0(x) - ^(x)) • y) > 5 1 

for 3>,5 |F| n values of x 6 S. Note that the quadratic ■?/> has been adjusted to take account 
for the possibility that Q x contains linear terms in y (which also depend affine-linearly 
on x). 

To conclude the argument we simply apply the Plancherel argument from Section [21 
This tells us that there are 1 values of r for which 

E y&N F(y)e ¥ (r ■ y) > 5 1. 

It follows from the pigeonhole principle that there is some r such that 4>(x) — ip(x) = r 
for ^>,5 |F| n values of x G S, which implies Proposition 14.21 □ 

Remark. Because of the use of the rank reduction argument in the proof of Proposi- 
tion the proof above does not seem to imply any implication between a conjectural 
polynomial version of Theorem 14.14 an d a polynomial version of Proposition 14.21 Also, 
we do not know if the implication can be reversed; the proof of Theorem 14. II in [3j [27] , 
is somewhat different from the arguments in [3 El E31 [23] , relying instead on ergodic 
theory and cohomological tools. 

Appendix A. Deduction of PGI(3) from PFR 

In this appendix we sketch how the polynomial Freiman-Ruzsa conjectures (Con- 
jectures 11.51 11.61) imply their respective polynomial inverse conjectures for the Gowers 
norms (Conjectures 11.101 11.111) . Roughly speaking, the idea is to run the arguments in 
[23J or [13] verbatim, but substituting the polynomial Freiman-Ruzsa conjectures in one 
key step of the argument where the usual inverse sumset theorems (basically, Theorem 
11.21 or 11.31 respectively) are currently used instead. It should be noted that the bulk of 
this implication is due to Gowers [S] . 
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Our sketch will be somewhat brief and in particular we will assume familiarity with 
either [23] or [T3] as appropriate. In the finite field case (i.e. the deduction of Conjecture 
11.101 from Conjecture 11.51) the modification is particularly straightforward; one simply 
repeats the argument in [23 J, but replacing [231 Theorem 6.9] (which is essentially 
Theorem II. 2ft by Conjecture 11.51 instead. To spell out the steps in a little more detail, 
suppose that K ^ 2, and let / : ¥ 2 — > C be a if-approximate quadratic: that is to say 
||/||f7 3 (F™) ^ l/K. By repeating the arguments up to and including [231 Lemma 6.7], 
one can find a function : ¥ 2 — > ¥ 2 such that the set 

{(x,y) G F™ x F 2 : <f>{x + y) = <f>(x) + <f>(y); \f x (x)\, \f y (y)\, \U y (x + y)\> K~ c } 

has density ^ K~ c inF"xF™, where f x (y) := f(x+y)J(xj and f(x) = E ye¥ nf(y)(-l) x -y 
is the usual Fourier transform. Now let 

A:={xe¥ n 2 :\f x (x)\>K- c }. 

Arguing as in [231 Section 6], but using Conjecture [L5] instead of [231 Theorem 6.9], one 
finds a linear transformation D : ¥ 2 — > ¥ 2 and zeFJ such that 4>{x) = Dx + z for a 
proportion at least cK~ c of all x G A, and thus 

F x& n\f x (Dx + z)\ 2 > K~ c . 

By modulating / by a suitable linear phase we may normalise so that z — 0. Continuing 
the argument in [231 Section 6] one concludes that the subspace U := {x G F 2 : Dx = 
D l x} of F2 has density K~ c , and so by further continuation of the argument one can 
find a symmetric transformation B : ¥ 2 — > ¥ 2 with zero diagonal coefficients such that 

E xe¥ n\f x (Bx)\ 2 >K- c . 

From the structure of B one can B = M + M l for some transformation M : ¥ 2 ^ ¥ 2 . 
A little Fourier analysis then shows that the function (— l) x ' Mx f(x) has a ?7 2 (F2) norm 
of at least K~ c , and so has an inner product of at least K~ c with a linear character, 
and Conjecture 11.101 follows. 

We turn now to the integer case, i.e. the deduction of Conjecture 11.111 from Con- 
jecture 11.61 This requires a little more modification, because the arguments in [13] 
proceeded not via inverse sumset theorems, but instead via the (closely related) device 
of Bogulybov-type theorems^. We think, in particular of [TBI Lemma 6.3]). However, as 
noted in [7], one could substitute inverse sumset theorems for Bogulybov-type theorems 
at this stage. 

We turn to the details. Let K ^ 2 and suppose that / : Z/iVZ — > C is a K- 
approximate quadratic where, for simplicity, iV is odd (this in fact implies the general 

4 It is possible that polynomial variants of these Bogolyubov-type theorems also hold, but so far as we 
know conjectures of this type are strictly stronger than Conjectures 11.51 and 11.61 
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case, an exercise we leave to the reader). Applying [131 Proposition 5.4], there is a set 
H' C Z/NZ of size \H'\ ^ K~ C N and a function £ : iZ 7 -> Z/iVZ whose graph 

r':={(A,^):/ieff'} 
is such that |9F - 8F| ^ K C N, and such that 

\m h )\^K- c 

for all /i G if", where /h(^) : = /(^ + h)f(x) as before, and /(£) := E£gz/ivz/OE)ejv(a;£) 
is the usual Fourier transform. 

Applying [131 Proposition 9.1], one obtains a regular Bohr set B 1 := B(S,p) with 
IS'I ^ K c , ^ p ^ | and £o,£ G Z/NZ, as well as a locally linear function M : 
5(5, |) -> Z/NZ such that 

E/, eBl l/f(^o + /i)l^ 0+fc =2M/ t+ « > (A.l) 

This was eventually used in [13] to deduce Theorem ll.91 An inspection of that deduction 
reveals that the argument would also work just as well if the Bohr set B(S,p) were 
replaced with a symmetric progression of dimension at most K c and cardinality at least 
exp(— K C )N . Furthermore, if one could instead replace B(S,p) with a progression of 
dimension at most K°W and cardinality at least exp(K ~°( 1 ))./V' then one could conclude 
Conjecture 11.111 instead of Theorem 11.91 Thus, our only task is to alter the argument 
of [131 Proposition 9.1], using the additional input of Conjecture 11.51 to obtain such a 
progression in place of B(S,p). 

By Conjecture ll.6i r' has large intersection with a translate of a symmetric generalised 
arithmetic progression P of dimension at most and cardinality at most e K ° W N. By 
[26, Theorem 3.40], P contains a proper symmetric generalised arithmetic progression 
P' 

P' = + ■ ■ ■ + l d x d : \k\ ^ Li} 

in Z/NZ x Z/NZ of dimension d ^ and volume at least e K ° 1 N. The progression 
P' — P' need not be a graph. However, since P 1 — P' + V C 2P — P has size at most 
e K ° {1) N } and V is a graph, we see that the intersection of P' with the vertical axis 
{0} x Z/NZ has cardinality at most e K ° {1 \ thus P' is in some sense "almost a graph" 
up to factors of e K ° W . Applying [131 Lemma 8.3] one can then find a Bohr set B(S, j) 
in Z/NZ with \S\ ^ K°M such that P - P n ({0} x B(S, |)) = {0}. In particular, the 
set P" := P' n (Z/NZ x B(S, §)) is a graph. 

One can write P" := <p(B), where B is the box {(h, . . . , Id) G Z : ^ Lj} in Z d 
and <ft : Z d —>■ (Z/NZ x Z/NZ) is the homomorphism 0('i, . . . , Id) '■= hx\ + . . . + ldX d . 
Observe that P" = (f>(B H B(S', -)) for some Bohr set B(S', |) in Z d . Applying Lemma 
IB. II |S"| times we see that B n B(S r ,%) contains a symmetric generalised arithmetic 



20 BEN GREEN AND TERENCE TAO 

progression Q of dimension at most and volume at least e~ K ° W N . By shrinking 

Q slightly we may in fact assume that Q — Q C B D B(S', |). Then <p(Q — Q) is a 
graph, or equivalently that <j)(Q) is Freiman isomorphic to its projection n((j)(Q)) to the 
first factor Z/iVZ of Z/iVZ x Z/AZ. Since P' was proper, we see that ir{4>{Q)) is also 
proper. We then conclude that 

<f>(Q) = {(x,Mx + $):xe tt(0(Q))} 

where £ G Z/JVZ, and M : vr(0(Q)) -> Z/iVZ is locally linear. 

As Q is a progression, we can find Q' — Q' inside Q where Q' C Q is another 
progression with dimension at most and cardinality at least e~ K ° W N . The set 

4>{Q') has relative density at least e~ K ° (1) inside P, which has a doubling constant of at 
most e K ° {1> , so by standard covering lemma arguments (see e.g. [26], Lemma 2.14]) one 
can cover P by at most e K ° 1 translates of cj>(Q') — <j){Q') C (f>(Q)- In particular, by the 
pigeonhole principle, some translate of </>(Q) intersects T' in at least e~ K ° {1) N points. If 
one then repeats the arguments used to prove [TUl Proposition 9.1] one obtains what 
was claimed, namely an analogue of (1A.1I) with B(S,p) replaced by a progression of 
dimension and size at least exp(— K ^')N. 

Appendix B. Bohr sets in generalised progressions 

The aim of this appendix is to prove Lemma IB.1[ the statement of which was as 
follows. 

Lemma B.l. Let e G (0,1/2) be a real number. Suppose that P is a d- dimensional 
proper progression with sidelengths Ni,...,Nd > C/e and that rj : P — ► R/Z is a 
Freiman homomorphism which vanishes at some point of P. Then the set {x G P : 
H^O^IIr/z ^ 6 } contains a progression of dimension at most d + 1 and size at least 
{Cd)- d e d+1 \P\. 

Proof. The progression P is an affine image of some box [Nx] x • • • x [iVj, and the lift 
of rj to this box is an affine map of the form x — > a\X\ + • • • + ct^Xd + (3. Henceforth we 
abuse notation by identifying P with the box [Nx] x • • • x [A^]. We are told that there 
is a point x* such that r](x*) = 0. By reparametrising P if necessary, we may assume 
that x* is in the same quadrant of P as the origin, thus x* G [Ai/2] x • • ■ x [N^/2], 
It turns out to be inconvenient later on if x* is too close to the boundary of P, so we 
begin with a preliminary argument to find a point x** which is deeper in the interior of 
P than x*, and at which rj is still small. To do this consider some m :— \2/e\ + l points 
Xi, . . . , x m G P such that the jth coordinate of Xi is roughly iNj/3m. By the pigeonhole 
principle there must be some pair of indices s < t such that \\r)(x t — x s )||r/z ^ £ /2, and 
then the point x** := x* + x t — x s will have the property that all of its coordinates 
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lie between eNj/10 and (1 — e / 10) Nj (note that we implicitly used here the fact that 
Nj > C/e). 

Let us now recentre so that x** is at the origin. Since x** was chosen to be somewhat 
central to P, the progression P certainly contains the symmetric progression P' : = 
Y[j=i[— Nj, Nj] in this new coordinate system, where iVj := eNj/10. Henceforth we 
work entirely in this new coordinate system and with this new progression P' . The 
Freiman homomorphism r\ : P' — > R/Z now takes the form r](x) = aiX\ + • • • + otdXd + (3 
where ||/3||r/z ^ e/2, and we may of course assume that ^ ctj < 1 for each j. 

At this point, one could conclude the argument (with worse bounds than claimed) 
using Lemma 4.20, Lemma 4.22], because the set where r\ is small is essentially a 
Bohr set in P'. To get the sharper bounds claimed in the theorem, we use a well-known 
lemma of Ruzsa [20, Theorem 3.1], in which the structure of Bohr sets was elucidated 
using the geometry of numbers. 

Lemma B.2. Suppose that M ^ 1 is an integer, that r\, . . . ,rd are residues (modM) 
such that hcf (n, . . . , r*., M) = 1, and that e\, . . .Ed G (0, 1/2) are real numbers. Then 
the Bohr set 

B(n, . . . , r d ; ei, . . . , e d ) := G Z/MZ : ||riar/M||n/z < ei, . . . , ||r d a;/M || R / Z < e d } 

contains a d- dimensional progression {that is, the image of a box under an affine map 
from Z d to Z/MZ) of cardinality at least d~ d E\ . . . e d M . □ 

Let Mi ^ . . . ^ Md be a very large odd coprime integers and set M := Mi . . . Md- 
Set Tj := Mj + i . . . Md for j = 1, . . . , d — 1 and := 1. For each j = 1, . . . , d choose an 
integer Sj, ^ Sj < Mj, such that \sj/Mj — ctj\ ^ 1/Mj. Set r d+ i := + • • • + r d Sd- 
Finally, set Ej := Nj/2Mj for j = and Ed+i '■= e/A. Our contention is that 

the Bohr set B' = B(ri, . . . , r^ +1 ; e\, . . . , £d+i) is contained in a set which is Freiman 
isomorphic to {x G P : ||^(^)||r/z ^ £}, at which point Lemma IB .11 follows easily from 
Lemma IB. 21 To begin with we show that the Bohr set B = B(r\, . . . ,rd',£\, ■ ■ ■ ,£d) 
is contained in a Freiman-isomorphic copy of P. Suppose that x G Z/MZ lies in 
B = B(ri, . . . ,rd',£\, ■ ■ ■ ,£d)- If £ G Z/MZ, we may write 

x = xi + x 2 M 1 H h x d M l . . . M d -i 

for unique integers Xi, . . . ,Xd with \xj\ < Mj/2. Observe that 

r\x xi r 2 x xi x 2 , , 1 , 

— — ^ = -——(modi), — — = - - - - + —— mod 1), 
M Mi h M .\/,.U, M 2 
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and so on. If x £ B then these may be applied in succession to obtain \\rix/M — 
Zi/MiIIk/z = 0, 

" ~M ~ W 2 " m ^ M2' " M~ ~ M3 " M/Z ^ M 2 M 3 + M3' ^ ' ' 

and so on. If the Mj are chosen appropriately (with M\ much bigger than M 2 and so 
on) this implies that ||:Tj/Mj|| 2Ej for j = l,...,d, which implies that \xj\ ^ iVj for 
all j. 

Now we have 

n sixi s d x d si s d N[ N' d e 

ll^ + --- + ^-^)lk/, < \ W -^\\xr\+---M W -a d \\x d \ < ^ + - + ^ < g, 

(B.2) 

provided that the Mj are chosen large enough in terms of N[, . . . , N' d and e. 
Furthermore the inequalities ( IB. II) imply that \\sirix/M — siXi/Mi\\ = 0, 

,,s 2 r 2 x S 2 X 2 S 2 E! 

|| M Ma ||r/z ^ M2M - 3 \/., ^ \/ 2 +£2 ' 
and so on. Adding, we clearly obtain 

r d+1 x SjXi s d x d £ 

M Mi " ' M d l|M/Z ^ 8 

provided that the Mj are selected to be large enough. 

Combining this with flB.21) . we obtain 

n r d+ xx e 
|| — — aixt "d^dlliR/z < -, 

and hence if x G B we certainly have ||aiXiH ha^dda/z ^ e/2 and hence ||^(^)||r/z ^ 

£. □ 
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